On the Computation of Maximal-Correlated Cuboids Cells

نویسندگان

  • Ronnie Alves
  • Orlando Belo
چکیده

The main idea of iceberg data cubing methods relies on optimization techniques for computing only the cuboids cells above certain minimum support threshold. Even using such approach the curse of dimensionality remains, given the large number of cuboids to compute, which produces, as we know, huge outputs. However, more recently, some efforts have been done on computing only closed cuboids. Nevertheless, for some of the dense databases, which are considered in this paper, even the set of all closed cuboids will be too large. An alternative would be to compute only the maximal cuboids. However, a pure maximal approaching implies loosing some information, this is one can generate the complete set of cuboids cells from its maximal but without their respective aggregation value. To play with some “loss of information” we need to add an interesting measure, that we call the correlated value of a cuboid cell. In this paper, we propose a new notion for reducing cuboids aggregation by means of computing only the maximal-correlated cuboids cells, and present the M3C-Cubing algorithm that brings out those cuboids. Our evaluation study shows that the method followed is a promising candidate for scalable data cubing, reducing the number of cuboids by at least an order of magnitude or more in comparison with that of closed ones.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Computing Data Cubes without Redundant Aggregated Nodes and Single Graph Paths: The Sequential MCG Approach

In this paper, we present a novel full cube computation and representation approach, named MCG. A data cube can be defined as a lattice of cuboids. In our approach, each cuboid is seen as a set of sub-graphs. Redundant suffixed nodes in such sub-graphs are quite common, but their elimination is a hard problem as some previous cube approaches demonstrate. MCG approach computes a data cube in two...

متن کامل

Approximating Smallest Containers for Packing Three-Dimensional Convex Objects

We investigate the problem of computing a minimum volume container for the non-overlapping packing of a given set of three-dimensional convex objects. Already the simplest versions of the problem are NPhard so that we cannot expect to find exact polynomial time algorithms. We give constant ratio approximation algorithms for packing axis-parallel (rectangular) cuboids under translation into an a...

متن کامل

Functional Speed Reserve as a Proxy for the Anaerobic Speed Reserve Using the Critical Speed Concept

Background: Although maximal sprint speed (MSS) and the anaerobic speed reserve (ASR) provides valuable information about the speed profile of athletes, these parameters fall short of providing important information about sub-maximal metabolic thresholds. The only field test that can offer an estimate of a sub-maximal metabolic threshold is the 3-minute all-out test for running (3MT) which deli...

متن کامل

A new heuristic algorithm for cuboids packing with no orientation constraints

The three-dimensional cuboids packing is NP-hard and finds many applications in the transportation industry. The problem is to pack a subset of cuboid boxes into a big cuboid container such that the total volume of the packed boxes is maximized. The boxes have no orientation constraints, i.e. they can be rotated by 90◦ in any direction. A new heuristic algorithm is presented that defines a conc...

متن کامل

Stream Cube : An Architecture 1 for Multi - Dimensional Analysis of Data

Real-time surveillance systems, telecommunication systems, and other dynamic environments often 19 generate tremendous (potentially infinite) volume of stream data: the volume is too huge to be scanned multiple 20 times. Much of such data resides at rather low level of abstraction, whereas most analysts are interested in relatively 21 high-level dynamic changes (such as trends and outliers). To...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006